Diffusion entropy analysis on the scaling behavior of financial markets 
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In this paper the diffusion entropy technique is applied to investigate the scaling behavior of 
financial markets. The scaling behaviors of four representative stock markets, Dow Jones Industrial 
Average, Standard&Poor 500, Heng Seng Index, and Shang Hai Stock Synthetic Index, are almost 
the same; with the scale-invariance exponents all in the interval [0.92, 0.95]. These results provide a 
strong evidence of the existence of long-rang correlation in financial time series, thus several variance- 
based methods are restricted for detecting the scale-invariance properties of financial markets. In 
addition, a parsimonious percolation model for stock markets is proposed, of which the scaling 
behavior agrees with the real-life markets well. 
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I. INTRODUCTION 

Analysis of financial time series attracts special atten- 
tions from diverse research fields for several decades. It 
can not only reveal the intrinsic dynamical properties of 
the corresponding financial markets but also provide us 
a clear scenario to construct dynamical models. Tradi- 
tional theories are constructed based upon some basic 
hypothesis, to cite examples, the stochastic processes in 
the markets and the homogenous property of the mar- 
kets, etc. The unexpected so-called rare events are ex- 
plained simply as the results due to accidents or exter- 
nal triggers|l|. The advancements in nonlinear theory 
lead a complete revolutionary in our ideas about financial 
markets. Instead of the deduced Gaussian distribution, 
empirical investigations in recent years indicate that the 
price return distribution of the financial time series gen- 
erally obeys the centered Levy distribution and displays 
fat-tail property, and the financial time series exhibits the 
scale-invariance behavior 0, 0, S IS IE • The nonlin- 
ear theory based analysis and dynamical models for the 
financial markets are the essential problems at present 
time. 

One of the important features of the financial time se- 
ries is the scale-invariance property, which can highlight 
the dynamical mechanics for the corresponding markets. 
Consider a complex system containing a large amount of 
particles. The scale-invariance property in the diffusion 
process of this system can be described mathematically 
with the probability distribution function as 

P(.T,t) = iF(5), (1) 
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where x is the displacements of the particles in the com- 
plex system and 5 the scale-invariance exponent. The 
theoretical foundation of this property is the Central 
Limit Theorem and the Generalized Central Limit The- 
orem 0, 0. For 5 = 0.5, the diffusion process is the 
standard diffusion and F{y) is the Gaussian function. 
And 6 ^ 0.5 exhibits the deviation of the dynamical 
process from the normal stochastic one. For a finan- 
cial time series, the delay— register vectors, denoted with 
{yk,yk+ir ■ -jUk+m-Ak = 1, 2, 3, • • •, TV - m -I- 1}, can 
be regarded as the trajectories of — m + 1 particles 
during the period of to m. By this way we can map 
a time series to a diffusion process, called overlapping 
diffusion process in this paper. An alternative solution 
is to separate the considered time series into many non- 
overlapping segments and regard these segments as the 
trajectories. 



In literature, several variance-based methods are pro- 
posed to detect the scale-invariance properties, such as 
the probability moment method the fluctuation ap- 
proach and the de-trended fluctuation approach [TH . etc. 
But these variance-based methods have two basic short- 
comings. One is that the scale-invariance property can 
be detected but the value of the exponent cannot be ob- 
tained correctly. The other is that for some processes, like 
the Levy flight, the variance tends to infinite and these 
methods are unavailabel at all. Although the infinite can 
not be reached due to the finite records of empirical data, 
clearly we can not obtain correct information about the 
dynamics under these conditions. Hence, in this paper 
we suggest the using of diffusion entropy (DE) technique 
to detect the scaling behavior of financial markets. 
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II. DIFFUSION ENTROPY TECHNIQUE AND 
DATA ANALYSIS 

To overcome the above shortcomings in the variance- 
based methods, the authors in reference designed the 
diffusion entropy analysis (DEA). To keep our description 
as self-contained as possible, we review the DEA method 
briefly. 

Filter out the trends in the original time series. Adopt- 
ing the traditional assumption generally used in the re- 
search filed of engineering, that a discrete time series vari- 
able consists of a slowly varying part and a fluctuation 
part [Hill, the index of a stock market reads, 

Cj=S,+Q,J^l,2,--- ,N, (2) 

where Q is the fluctuation with zero mean and fixed vari- 
ance. 

In the signal processing, the slow and regular variation 
Sj is usually called signal, which contains the useful in- 
formation. And the rapid erratic fluctuation Q is called 
the noise, which is regarded as perturbations containing 
only trivial information. In the DEA method, however, 
the scale— invariance will be detected from this fluctua- 
tion part. The "step smoothing" procedure is em- 
ployed to estimate the Sj part in our calculations, that 
is, regard the average of the segments as the trends, re- 
spectively. The final time series is regarded as a steady 
series, whose overlapping diffusion process reads, 

k+t 

a;fcW = I]0,fc = l,2,--- 1. (3) 

Consequently, the Shannon entropy can be defined as, 

/+CX; 
P(x,t)logio[P(x,t)]dx. (4) 
-OO 

A simple algebraic leads to, 

Sit)^A + 6\og,oit), (5) 

where 

A = - F{v)\og^^[F{v)]dx,y = -g. (6) 

The DEA method has been used to deal with many time 
series in different research fields, such as the solar in- 
duced atmosphere temperature series the intermit- 
tency time series in fluid turbulence 16], the spectra of 
complex networks|l7j. and the output spike trains of 
neurons [T^. 

In order to truly uncover the scaling behavior of fi- 
nancial market, we study both the domestic and over- 
sea stock markets. The domestic consist of Shang Hai 
Stock Synthetic Index (SS) with length N = 5412, 
and Hang Seng Index (HSI) with length N = 191076, 
while Dow Jones Industrial Average (DJIA) with length 
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FIG. 1: (Color online) DE results for four time series from 
different stock markets. The squares, circles, upward trian- 
gles and downward triangles denote the results for HIS, SS, 
S&P500 and DJIA, respectively. The corresponding scaling 
exponents are Shsi = 0.944 ± 0.004, 5ss = 0.923 ± 0.004, 
5s&P500 = 0.950 ± 0.005 and Sdjia = 0.948 ± 0.005, The 
solid line, whose slope is Sfit = 0.945, is presented as ref- 
erence. The occurrence of saturation regime caused by the 
"Step smoothing" is t = 365. Consequently, the fitting in- 
terval should be in the range oi t < y 13:]. In the fitting 
procedure is implemented in the interval of f G Jl, 100] 



N = 29627 and Standard&Poor 500 (S&P500) with 
length N = 5695 are constituted of the oversea. In 
Fig. 1, the DE technique is applied to analysis these 
real financial time series and we demonstrate the scaling 
behavior of different Indices are almost same, with the 
scaling exponents all in the interval [0.92, 0.95]; as show- 
ing below: 5ss = 0.923 ± 0.004, 5hsi = 0.944 ± 0.004, 
Sdjia = 0.948 ± 0.005, and Ss&cPboo = 0.950 ± 0.005. 
These results provide a strong evidence of the existence 
of long-rang correlation in financial time series, thus sev- 
eral variance-based methods are restricted for detecting 
the scale-invariance properties of financial markets. 

Herein it must be noted that the occurrence of satura- 
tion regime caused by the "Step smoothing" is i = 365. 
Consequently, the fitting interval must be only a lim- 
ited range of times, t < |-0], which we estimate to be 
t E [1,100]. 



III. THE MODEL 

There are many modelling methods to explain the ori- 
gins of the observed nonlinear scaling behavior of mar- 
ket price as emerging from simple behavioral rules of 
a large number of heterogeneous market participants, 
such as behavior- mind models [T^ I2(tI ]. dynamic-games 
models I^Lcellular- automata models[23L l2^. multi- 
agent modek]0, |2^, and so on. Here in this paper. 
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we consider a stock market model based on percolation 
theory^. Cont and Bouchaud(28i| successfully applied 
percolation theory to modelling the financial market (CB 
model), which is one of the simplest models able to ac- 
count for the main stylized fact of financial markets, e.g. 
fat tails of the histogram of log-returns. Up to now, 
the percolation theory is widely used in modelling stock 
markets,29, 30, 31, 32]. 

Based on percolation theory, our model incorporates 
the following components different from the original CB 
model: (1) The cluster, defined as a set of interconnected 
investors, grows in a self-organized process. (2) The ef- 
fect of " herd behavior" on the trade- volume is magnified 
step by step during the cluster's self-organized accumu- 
lating process rather than instantaneously formed like EZ 
model and its varietv(33. ,i34„ .35. .36j. (3) Some encoun- 
tering smaller clusters will form a bigger cluster through 
cooperating or one defeating the rivals. (4) An infinite 
cluster maybe exist without the need to tune p to Pc 
and its trade activity influence price fluctuation. (5)The 
number of investors participating in trading will vary dy- 
namically. Now let's see the model. 

A. Dynamic of investor groups 

Initially, M investors randomly take up the sites of an 
L * L lattice. Then for each cluster, a strategy is given: 
buying, selling, or sleeping, which are denoted by 1, -1 
and respectively. In reality, the circle of professionals 
and colleagues to whom a trader is typically connected 
evolves as a function of time: in some cases, traders are 
following strong herding behavior and the effective con- 
nectivity parameter p is high; in other cases, investors are 
more individualistic and smaller values of p seem more 
reasonable. In order to take the complex dynamics of in- 
teractions between traders into account, we assume that 
it undergoes the following evolution repeatedly: 

(l)Growth: most of investors would like to imitate 
the strategies which have been adopted by many others, 
which induces "herd behavior" occurring. In this sense 
the herd behavior is amplified. Specially, the affection of 
the herd behavior will be magnified gradually with the 
increase of the number of investors adopting this strat- 
egy, i.e., with the growth of the clusters. During cluster's 
growth, a number of new investors will be attracted by it 
and become its members. In other words, every cluster 
will absorb new investors with the probability 

Pd{T)=Pd{T-l)+k{NT~NT{T^l)), (7) 

where fc is a kinetic coefficient controlling the growth 
speed and Nt is a threshold parameter (It has been vali- 
dated that the value of the parameters k and Nt could be 
any value. These two parameters have no effects on the 
self-organization process of the clustersjs^)- Nt{t — 1) is 
the number of the agents along its boundary, defined as 
a set made up of agents which belong to a cluster and at 
least border on a site which isn't part of this cluster, at 



the last time step r — 1. The new participating investor 
will take up the empty sites around the old clusters and 
imitate the same strategy as that of it. The probabil- 
ity Pd is obviously limited to the range [0,1] so that we 
have to impose Pd — and Pd = 1, if the recurrence 
relationship Equ.7 gives values for < or > 1- 

(2) New cluster's birth: some investors will randomly 
and independently enter the market with the probability 
Pn- These investors don't belong to an arbitrary existing 
cluster and will take up the empty sites. 

(3) Cooperation: encountering clusters will operate 
cooperation and confliction between them. When their 
strategies are same, they are thought to cluster together 
to form a new group of infiuence. Or there would be con- 
fliction between them. The consequence of confliction is 
that losers would be annexed by the winner and that a 
new and bigger cluster whose strategy inherent the win- 
ner's will be formed. The probability of cooperation or 
confliction is as follow, i.e., some a cluster will cooperate 
with or defeat others with the probability 

Pm{k) kJl, (8) 

where |s^| is the size of k-th cluster at time r. 

(4) Metabolism: in reality, no matter how huge has 
the size of a group ever been it would collapse due to 
different influences such as government decision on war 
and peace. Some new clusters will come into the world in 
the wake of aging clusters' death. The probability with 
which an aging clusters will die is: 



where x and y is the width of this cluster occurring on 
the lattice in the x and y direction. Equ.(4) indicates 
that the probability with which a cluster disbands would 
increase with the cluster growth. Once a spanning cluster 
exists, it would surely die. When a cluster disbands, all 
its members would leave the market and the sites where 
the death cluster ever occupied will be taken up by new 
investors with the probability P„. Such occupied sites 
form a few new clusters. Every new cluster would be 
given a strategy randomly. 

Although each cluster could trade with others at ev- 
ery trading step, the evolution frequency of the network 
topology should not be so often. Thus, we assume that 
the network structure of the market composed by investor 
groups would evolve every N trading steps. With the 
evolution of this artificial stock market, the number of 
investors participating in trading isn't constant. The net- 
work will take on different structure; the affection of the 
herd behavior on the trade-volume is gradually magni- 
fied. Without any artificial adjustment of the connectiv- 
ity probability p to pc, spanning cluster may exist, whose 
activity would influence the price fluctuation. 
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FIG. 2: (Color online) Time series of the typical evolution of 
the stock price in the interval . The insect is the Dow Jones 
Industrial Average (DJIA) from 01-02-1940 to 12-31-1987. 

B. Trading rules 

Each cluster trades with probability a (called activity) ; 
if it trades, it gives equal probability to a buying or sell- 
ing with demand proportional to the cluster size. The ex- 
cess demand is then the difference between the sum of all 
buying orders and selling orders received within one time 
interval. The price changes from one time step to the 
next by an amount proportional to the excess demand. 
To explain the "volatility", Stauffer introduces the feed- 
back mechanism between the difference of the "supply 
and demand" and activity of the investors |23- Whereas 
in our model, the difference of the "supply and demand" 
not only affects the activity probability but also the prob- 
ability with which the active clusters choose to buy or 
sell. The probability a evolves following the Equ.(5): 

a{t) = a{t - 1) + lr{t - 1) + a, (10) 

where r is the difference between the demand and sup- 
ply, I denotes the sensitivity of a to r and a measures 
the degree of impact of external information on the ac- 
tivity. Each active cluster choose to buy or sell with 
probabilities ^a{t){l — ps{t)) and ^a{t)ps{t) respectively. 
For r > 0, Ps{t) ^ 0.5 + dir{t - 1), while for r < 0, 
Ps{t) = 0.5 -I- d2r{t — 1). According to Kahncman and 
Tverskyjs^, it is asymmetry that agents make their de- 
cisions when they are in face of gain or loss. When refer- 
ring to gain, most of the agents are risk adverse. On the 
contrary, they are risk preference. These determine the 
parameters di and d2 , representing the sensitivity of the 
agent's mentality to the price fluctuations and differing 
from each other. In our model we assume d2 = 2di. The 
difference between the demand and supply is: 

m 

r(t)=5^sign(s^)(|.^|)^ (11) 
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FIG. 3: (Color online) The DE result, the black squares 
denote the time series generated by the present model, with 
the scaling Smodei ~ 0.939, which is almost the same as that 
of DJIA denoted by red circles. 

where m is the total number of clusters occurring on the 
market. 7 measures the degree of impact of each cluster's 
trade- volume on the price, < 7 < 1 allowing for a 
nonlinear dependence of the change of (the logarithm of 
the) price as a function of the difference between supply 
and demandjs^. So the evolution of the price is: 

P^{t) ^Pr{t~ 1) exp(Ar(i)) (12) 



C. Simulation 

When proper initial condition and parameters have 
been chosen, the artificial stock market can generate its 
stock price. In Fig. 2 we present a typical simulation 
result about price time series generated by our model, 
which is rather similar to the real- life index (inset). The 
parameters used here are a(0) = 0.09, 7-(0) = 0.09, 
Pr{0) = 1, Pd{0) = 0.4, k = 0.0001, Nt ^ 50,1 = ^, 
L = 100, di = 0.00005, 7 = 0.78, P„ = 0.6, M = 100, 
N = 50. 

By numerical studies, we have demonstrated that this 
model exhibits the stylish facts with the price returns 
distribution is a Levy distribution in the central part fol- 
lowed by an approximately exponential truncation ^4^, 
and displays power-law avalanche size distribution that 
agrees with the real-life markets well|4lj. 

In succession, we investigate the scaling behavior of the 
stock price time series generated by the present model. 
One can see clearly in Fig. 3, the scaling of this arti- 
ficial stock market is Smodei — 0.939 ± 0.003, which is 
excellently agree with the real-life markets (see also the 
empirical data of DJIA for comparison). This numeri- 
cal results strongly suggest that the present model has 
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successfully reproduced some dynamical characters of re- 
ality. 

IV. CONCLUSION 

In summary, by means of the DE method we inves- 
tigate the scaling behavior embedded in the time series 
of four typical financial markets. The scale-invariance 
exponents are almost the same, being in the interval of 
[0.92,0.95]. This large deviation from the Gaussian dis- 
tribution reveals the strong correlations in the time se- 
ries. The present empirical study of the scaling behavior 
in real markets also provides a usable quantity to check 
the reliability of artificial market model. 



Consequently, we propose a parsimonious percolation 
model for stock market. Proper initial condition and pa- 
rameters can lead its stock price series being similar to 
the real-life index. Especially, the scaling behavior de- 
tected with the DE method agrees with the real-life fi- 
nancial markets very well. 
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